Evaluating Representations for the Shine-Dalgarno Site in Escherichia coli

نویسندگان

  • Steven Hampson
  • Dennis Kibler
چکیده

Several methods for identifying individual motif instance by exhaustive evaluation of k-mers (k ≤ 10) are applied to the pooled Upstream Regions (USR) of all 4289 Escherichia coli ORFs. Instances of the Shine-Dalgarno (SD) site are readily identified using these methods. Using these motif instances as starting points, various motif representations and training methods, including several new algorithms, are applied to characterize the complete SD motif. Motif representation languages of increasing power give increasingly better characterizations of the SD motif, permitting more SD sites to be reliably identified. In particular, matrix representation is better than IUPAC which is better than k-mer prototype. However, overly powerful representation also results in suboptimal characterization. A variety of matrix techniques using different representations, objective functions and learning methods yield approximately the same motif, providing evidence for the robustness of the result and the effectiveness of the methods. By these measures, about 1/4 of the ORFs have no better than random SD sites. More biologically realistic motif representation languages might further reduce that fraction.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Translational regulation of the L11 ribosomal protein operon of Escherichia coli: mutations that define the target site for repression by L1.

The L11 ribosomal protein operon of Escherichia coli contains the genes for L11 and L1 and is feedback regulated by the translational repressor L1. The mRNA target site for this repression is located close to the Shine-Dalgarno sequence for the first cistron, rp1K (L11). By use of a random mutagenesis procedure we have isolated and characterized a series of point mutations in the L11 leader mRN...

متن کامل

Listeria Monocytogenes La111 and Klebsiella Pneumoniae KCTC 2242: Shine-Dalgarno Sequences

Listeria monocytogenes can cause serious infection and recently, relapse of listeriosis has been reported in leukemia and colorectal cancer, and the patients with Klebsiella pneumoniae are at increased risk of colorectal cancer. Translation initiation codon recognition is basically mediated by Shine-Dalgarno (SD) and the anti-SD sequences at the small ribosomal RNA (ssu rRNA). In this research,...

متن کامل

An efficient Shine-Dalgarno sequence but not translation is necessary for lacZ mRNA stability in Escherichia coli.

The 5' ends of many bacterial transcripts are important in determining mRNA stability. A series of Shine-Dalgarno (SD) sequence changes showed that the complementarity of the SD sequence to the anti-SD sequence of 16S rRNA correlates with lacZ mRNA stability in Escherichia coli. Several initiation codon changes showed that an efficient initiation codon is not necessary to maintain lacZ mRNA sta...

متن کامل

Silent mutations in secondary Shine-Dalgarno sequences in the cDNA of human serum amyloid A4 promotes expression of recombinant protein in Escherichia coli.

The serum amyloid A (SAA) superfamily comprises a number of differentially expressed genes with a high degree of homology in mammalian species. SAA4, an apolipoprotein constitutively expressed only in humans and mice, is associated almost entirely with lipoproteins of the high-density range. The presence of SAA4 mRNA and protein in macrophage-derived foam cells of coronary and carotid arteries ...

متن کامل

CsrA inhibits translation initiation of Escherichia coli hfq by binding to a single site overlapping the Shine-Dalgarno sequence.

Csr (carbon storage regulation) of Escherichia coli is a global regulatory system that consists of CsrA, a homodimeric RNA binding protein, two noncoding small RNAs (sRNAs; CsrB and CsrC) that function as CsrA antagonists by sequestering this protein, and CsrD, a specificity factor that targets CsrB and CsrC for degradation by RNase E. CsrA inhibits translation initiation of glgC, cstA, and pga...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002